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AMENDMENTS TO THE CLAIMS 

Applicant submits below a complete listing of the current claims, including marked-up 
claims with insertions indicated by underlining and deletions indicated by strikeouts and/or 
double bracketing. This listing of claims replaces all prior versions, and listings, of claims in the 
application: 

1-13. (Canceled). 

14. (Currently amended) A system for providing transcription of a conference 
between two or more individuals a plurality of participants of the conference , the system 
comprising: 

a plurality of reception stages to receive speech information from the plurality of 
participants over a respective plurality of transmission channels; and 

at least one processor adapted to implem e nt capable of receiving the information from the 
plurality of reception stages, the at least one processor programmed to : 

analyze the information a channel recogniz e r coupled to the plurality of r e ception 
stages to receiv e spe e ch information, the channel recogniz e r r e cognizing which of th e 
received at the plurality of reception stages to determine which of the plurality of 
participants of the conference is speaking is receiving speech information during a given 
time interval based, at least in part, on identifying which of the plurality of reception 
stages is receiving speech information; 

select one of the plurality of transmission channels corresponding to the reception 
stage identified as receiving speech information as to identify an in-use channel of the 
plurality of transmission chann e ls and to; 

determine provid e channel information including at least one transmission 
parameter of the in-use channel; 

extract a f e ature vector extractor for extracting at least one feature vector from the 
speech information based, at least in part, on the channel information; 
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perform a s e gm e ntation recogniz e r for performing acoustic segmentation of the 
speech information to provid e generate acoustic segmentation information indicating at 
least one segment identified in the speech information based, at least in part, on the 
channel information and the at least one feature vector, the acoustic segmentation 
information including a label for the at least one segment of the speech information 
indicating whether the at least one segment is associated with speech, a pause in speech 
or non-speech; 

determine a language recognizer for det e rmining a language of the speech 
information based, at least in part, on the channel information, the at least one feature 
vector and the acoustic segmentation information; and 

generate a speech r e cognizer for providing text information corresponding to 
words recognized in the speech information based, at least in part, on the channel 
information, the at least one feature vector, the acoustic segmentation information and the 
language. 

15. (Previously presented) The system of claim 14, wherein the plurality of reception 
stages include at least two of the following: 

at least one sound card installed in at least one computer, the sound card connected to at 
least one microphone; 

at least one connection adapted to receive at least one analog telephone line; 

at least one connection adapted to receive at least one digital telephone line; 

at least one connection adapted to receive at least one Integrated Services Digital 
Network (ISDN) telephone line; 

at least one connection adapted to receive at least one data network channel; and 

at least one connection adapted to receive a voice-over-internet-protocol (VoIP) data 

stream. 

16. (Previously presented) The system of claim 15, wherein the channel information 
includes bandwidth information of the in-use channel. 
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17. (Currently amended) The system of claim 15, wherein the at least one processor 
is programmed to furth e r comprising a topic recognizer for r e cognizing recognize at least one 
key word in the speech information based, at least in part, on the language of the speech 
information, and wherein the speech recognizer provides the text information based, at least in 
part, on the at least one key word. 

18. (Currently amended) The system of claim 17, wherein the at least one processor 
is programmed to further comprising a speaker group recogniz e r for recognizing recognize a 
speaker group associated with the speech information based, at least in part, on the channel 
information and the language of the speech information, and wherein the speech recognizer 
provides the text information based, at least in part, on the speaker group. 

19. (Currently amended) A method of providing transcription of a conference 
between two or more individuals a plurality of participants of the conference , the method 
comprising: 

receiving sp ee ch information over a plurality of transmission channels from the plurality 
of participants ; 

analyzing the information received at recognizing which of the plurality of transmission 
channels reception stages to determine which of the plurality of participants of the conference is 
speaking is receiving sp ee ch information during a given time interval based, at least in part, on 
identifying which of the plurality of reception stages is receiving speech information to identify ; 

selecting one of the plurality of transmission channels corresponding to the reception 
stage identified as receiving speech as an in-use channel of the plurality of transmission 

determining providing channel information including at least one transmission parameter 
of that identifies the in-use channel; 

extracting at least one feature vector from the speech information based, at least in part, 
on the channel information; 
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performing acoustic segmentation of the speech information to provide generate acoustic 
segmentation information indicating at least one segment identified in the speech information 
based, at least in part, on the channel information and the at least one feature vector, the acoustic 
segmentation information including a label for the at least one segment of the speech information 
indicating whether the at least one segment is associated with speech, a pause in speech or non- 
speech; 

determining a language of the speech information based, at least in part, on the channel 
information, the at least one feature vector and the acoustic segmentation information; and 

generating providing text information corresponding to words recognized in the speech 
information based, at least in part, on the channel information, the at least one feature vector, the 
acoustic segmentation information and the language of the speech information. 

20. (Previously presented) The method of claim 19, wherein receiving speech 
information over a plurality of transmission channels includes receiving speech information via 
at least two of the following: 

at least one sound card installed in at least one computer, the sound card connected to at 
least one microphone; 

at least one analog telephone line; 
at least one digital telephone line; 

at least one Integrated Services Digital Network (ISDN) telephone line; 

at least one data network channel; and 

at least one voice-over-internet-protocol (VoIP) data stream. 

21 . (Previously presented) The method of claim 20, wherein the channel information 
includes bandwidth information of the in-use channel. 

22. (Previously presented) The method of claim 19, further comprising recognizing 
at least one key word in the speech information based, at least in part, on the language of the 
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speech information, and providing the text information is based, at least in part, on the at least 
one key word. 

23. (Previously presented) The method of claim 22, further comprising recognizing a 
speaker group associated with the speech information based, at least in part, on the channel 
information and the language of the speech information, and wherein providing the text 
information is based, at least in part, on the speaker group. 

24. (Currently amended) A computer readable storage device encoded with a 
plurality of instructions for execution on at least one processor, the plurality of instructions, 
when executed on the at least one processor, performing a method of providing transcription of a 
conference between two or mor e individuals a plurality of participants of the conference , the 
method comprising: 

receiving sp ee ch information over a plurality of transmission channels from the plurality 
of participants ; 

analyzing the information received at recognizing which of the plurality of transmission 
chann e ls reception stages to determine which of the plurality of participants of the conference is 
speaking is r e ceiving sp e ech information during a given time interval based, at least in part, on 
identifying which of the plurality of reception stages is receiving speech information to id e ntify ; 

selecting one of the plurality of transmission channels corresponding to the reception 
stage identified as receiving speech as an in-use channel of th e plurality of transmission 

determining providing channel information including at least one transmission parameter 
of that identifies the in-use channel; 

extracting at least one feature vector from the speech information based, at least in part, 
on the channel information; 

performing acoustic segmentation of the speech information to provide generate acoustic 
segmentation information indicating at least one segment identified in the speech information 
based, at least in part, on the channel information and the at least one feature vector, the acoustic 
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segmentation information including a label for the at least one segment of the speech information 
indicating whether the at least one segment is associated with speech, a pause in speech or non- 
speech; 

determining a language of the speech information based, at least in part, on the channel 
information, the at least one feature vector and the acoustic segmentation information; and 

generating providing text information corresponding to words recognized in the speech 
information based, at least in part, on the channel information, the at least one feature vector, the 
acoustic segmentation information and the language of the speech information. 

25. (Previously presented) The computer readable storage device of claim 24, 
wherein receiving speech information over a plurality of transmission channels includes 
receiving speech information via at least two of the following: 

at least one sound card installed in at least one computer, the sound card connected to at 
least one microphone; 

at least one analog telephone line; 
at least one digital telephone line; 

at least one Integrated Services Digital Network (ISDN) telephone line; 

at least one data network channel; and 

at least one voice-over-internet-protocol (VoIP) data stream. 

26. (Previously presented) The computer readable storage device of claim 25, 
wherein the channel information includes bandwidth information of the in-use channel. 

27. (Previously presented) The computer readable storage device of claim 24, further 
comprising recognizing at least one key word in the speech information based, at least in part, on 
the language of the speech information, and providing the text information is based, at least in 
part, on the at least one key word. 
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28. (Previously presented) The computer readable storage device of claim 27, further 
comprising recognizing a speaker group associated with the speech information based, at least in 
part, on the channel information and the language of the speech information, and wherein 
providing the text information is based, at least in part, on the speaker group. 



